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Hominoid fission of chromosome 14/15 and the role 
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Ape chromosomes homologous to human chromosomes 14 and 15 were generated by a fission event of an ancestral 
submetacentric chromosome, where the two chromosomes were joined head-to-tail. The hominoid ancestral chromosome 
most closely resembles the macaque chromosome 7. In this work, we provide insights into the evolution of human 
chromosomes 14 and 15, performing a comparative study between macaque boundary region 14/15 and the orthologous 
human regions. We construct a 1.6-Mb contig of macaque BAC clones in the region orthologous to the ancestral hominoid 
fission site and use it to define the structural changes that occurred on human 14q pericentromeric and 15q subtelomeric 
regions. We characterize the novel euchromatin-heterochromatin transition region [-20 Mb) acquired during the 
neocentromere establishment on chromosome 14, and find it was mainly derived through pericentromeric duplications 
from ancestral hominoid chromosomes homologous to human 2ql4-qter and 10. Further, we show a relationship between 
evolutionary hotspots and low-copy repeat loci for chromosome 15, revealing a possible role of segmental duplications not 
only in mediating but also in "stitching" together rearrangement brealcpoints. 

[Supplemental material is available for this article.] 



Human chromosomes 14 and 15 were generated by fission of an 
ancestral chromosome composed of chromosome 14 attached, 
head-to-tail, to chromosome 15 around 25 million years ago 
(Ventura et al. 2003). The rearrangement occurred in the hominoid 
ancestor and involved a cluster of olfactory receptor (OR) genes 
(Rudd et al. 2009). At the point of fission, the 15q telomere and the 
neocentromere of chromosome 14 were seeded (Ventura et al. 2003). 
The 15q subtelomeric region experienced numerous structural rear- 
rangements, including interstitial deletions and transfers of material 
to and from other subtelomeric regions (Rudd et al. 2009). 

Low-copy repeats (LCR), also named segmental duplications 
(SDs) (Bailey et al. 2001; Cheung et al. 2001), mediate chromosomal 
rearrangements through non-allelic homologous recombination 
(NAHR) events (Hastings et al. 2009) and are enriched at pericen- 
tromeric and subterminal regions of the human genome (Bailey and 
Eichler 2006). LCR15 (low-copy repeats on human chromosome 15) 
loci have been widely studied (Pujana et al. 2001; Zody et al. 2006); 
those of primary focus are on 15qll-ql3 because of their in- 
volvement in the Prader-Willi and Angelman syndromes (Pujana 
et al. 2002). 

In this work, we provide insights into and highlight the role 
of LCR in the evolution of human chromosomes 14 and 15. We 
assembled a 1.6-Mb contig of macaque BAC clones encompassing 
the 14/15 boundary and tracked evolutionary changes initiated by 
the fission in the human lineage. We detected duplications from 
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the newly seeded subtelomeric region on chromosome 15 and 
pericentromeric region on chromosome 14 to other existing sub- 
telomeric and pericentromeric regions, respectively. Further, we 
traced the dynamics of the formation of the chromosome 14 
neocentromere and demonstrated the recruitment of new peri- 
centromeric sequences mainly from the pericentromeric regions of 
ancestral chromosomes orthologous to human 2ql4-qter and 10. 
We found a correlation between clusters of LCR15 and evolu- 
tionary "hotspot" regions. (Note: For simplicity we used the or- 
thologous numbering of great ape and human chromosomes as 
proposed for great ape genome sequencing projects [Supplemental 
Fig. SI; McConkey 2004].) 

Results 

Evolutionary history of chromosomes 14 and 15 

We reconstructed the evolutionary history of chromosomes 14 and 
15 in the primate lineage from published data (Yunis et al. 1980; 
Yunis and Prakash 1982; Locke et al. 2003; Ventura et al. 2003; 
Zody et al. 2006; Stanyon et al. 2008; Locke et al. 2011; Ventura 
et al. 2011) and data from the marmoset genome sequencing 
project (Marmoset Genome Sequencing Consortium, unpubl.). 
The synteny of chromosomes 14/15 is ancestral to placental 
mammals (Wienberg and Stanyon 1995, 1997; Kent et al. 2002; 
Pontius et al. 2007; Wade et al. 2009; Meyer et al. 2013), as dem- 
onstrated by cat chromosome B3, horse chromosome 1, and rabbit 



© 201 3 Giannuzzi et al. This article is distributed exclusively by Cold Spring 
Harbor Laboratory Press for the first six months after the full-issue publication 
date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it 
is available under a Creative Commons License (Attribution-NonCommercial 
3.0 Unported), as described at http://creativecommons.org/licenses/by-nc/ 
3.0/. 



23:1763-1773 Published by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/13; www.genome.org 



Genome Research 1 763 

www.genonne.org 



Giannuzzi et al. 



chromosome 17 (UCSC Genome Browser alignment net, http:// 
genome.ucsc.edu/). In the primate ancestor, chromosomes 14 and 
15 were a submetacentric chromosome with a small p arm (Fig. 1). 
In the marmoset lineage, one fission and two inversions derived 
marmoset chromosomes 6p and 10, with 6p maintaining the an- 
cestral association of A-F-E markers (Zody et al. 2006). A pericentric 
inversion in the Catarrhini ancestor generated a submetacentric 
chromosome with the same organization as the rhesus macaque 
chromosome 7 (MMU7). In the Hominoidea ancestor the sub- 
metacentric chromosome underwent a fission event and gen- 
erated the acrocentric chromosomes 14 and 15. The ancestral 
centromere was inactivated and two neocentromeres emerged 
(Ventura et al. 2003). Finally, a pericentric inversion occurred in 



the gorilla and chimpanzee lineages, respectively on chromo- 
somes 14 and 15 (Yunis et al. 1980; Yunis and Prakash 1982; 
Locke et al. 2003; Ventura et al. 2011). 

Comparative analysis of the human and macaque genomic 
regions orthologous to the hominoid 14/15 fission site 

MMU7 has a chromosomal organization resembling the ancestral 
hominoid 14/15 association and shows perfect marker order con- 
servation with respect to the two human-derivative chromosomes 
(Ventura et al. 2003). To gain insights into the genomic organiza- 
tion of the hominoid fission region, we constructed a 1.6-Mb 
contig of 61 macaque BAG clones on MMU7 spanning the 14/15 
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Figure 1 . Evolutionary history of chromosomes 1 4 and 1 5. Ten segments (A-J) are used to track the chromosomal rearrangements and are defined as in 
Stanyon et al. (2008). The following abbreviations are used: A in red, ancestral centromere; N in red, evolutionary neocentromere; (HSA) Homo sapiens; 
(PTR) Pan troglodytes; (GGO) Gorilla gorilla; (PPY) Pongo pygmaeus; (MMU) Macaca mulatta; (CJA) Callithrix jacchus. In Figures 1, 2, and 5, chromosomes 
1 4 and 1 5 are colored according to the color code of the UCSC Genome Browser (dark blue for chromosome 1 4 and light blue for chromosome 1 5). The 
NOR (nucleolus organizer region) bearing short arm is drawn with a horizontal black/white striped box. 
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boundary orthologous to human 14qll and 15q26.3 (Fig. 2; 
Supplemental Table SI). We screened the macaque genomic li- 
brary, CHORI-250, using STS derived from the human 15q sub- 



terminal (Hsal5a-d) and 14q pericentromeric (Hsal4a-c) regions 
(hgl8/build36) together with macaque-specific STS derived from 
CH250 BAG end sequences (BBS). The corresponding region was 
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Figure 2. Contig of CH250 BAC clones on MMU7. The schematic shows features of the macaque genomic region spanned by the contig 
(chr7:81 552000-831 32000, rheMac2). In particular, gap and STS locations, BAC clones, FISH mapping data, chained alignments with hglS reference, 
WSSD (whole-genome shotgun sequence detection) depth-of-coverage data (the green color corresponds to single copy; the red color to duplication; 
Marques-Bonet et al. 2009), macaque RefSeq genes, and human RefSeq genes in the orthologous human regions are shown. STS names are color-coded: 
STS used in radioactive hybridization (orange), STS used in PCR amplification (blue), and STS used in both assays (green). FISH results on MMU7 and 
human chromosomes 14/15 are shown for representative BAC clones marked by names and arrowheads in red. Seven different FISH patterns are 
identified: BAC clones of Groups F1-F3 map to the homologous region on chromosome 15 (light blue); BAC clones of Groups G2-G4 map to the 
homologous region on chromosome 1 4 (dark blue); and gray-colored BAC clones (Group G1 ) showed absent or weak signal on human chromosome 1 4 
and correspond to the fission breakpoint. An asterisk in front of the BAC name indicates BAC clones belonging to the second copy of the macaque 
duplication. A yellow arrowhead marks BES not anchored to the human reference hgl 8. Additional FISH signals on other human chromosomes due to the 
presence of interchromosomal SDs (further indicated by the chained alignments) are shown for the BAC clones of Groups F3, G2, and G3. An empty black 
box marks the human GOLGA2-ITSN2 core duplicon. Both end segments of the contig (Groups F1 and G4) are in single copy and direct orientation in 
macaque and human. 
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single copy in macaque, except for a 134-kb segment duplicated in 
tandem. We FISH- (fluorescence in situ hybridization) mapped all 
clones on both macaque and human chromosome metaphases 
and grouped them accordingly by FISH pattern (F1-F3 clones map 
to human 15q26.3; G1-G4 clones map to human 14qll); most of 
the clones were single copy in macaque and duplicated intra- and/ 
or interchromosomally in human (Supplemental Note). We gen- 
erated 91 BBS and mapped them on the macaque (rheMac2) and 
human (hgl8/build36) reference genome assemblies. We found 
consistent placements on rheMacZ, except for those partly cover- 
ing assembly gaps. In the human reference assembly several BACs 
were one-end anchored and some end sequences showed multiple 
matches due to the presence of SDs in the human genome (Sup- 
plemental Table SI). 

Extreme groups (Fl and G4) are made up of clones spanning 
single copy and directly oriented segments in human and ma- 
caque. For the homologous region on chromosome 15, we iden- 
tified an LCR15 copy in macaque and human genomes (Group F2) 
and human subterminal duplications of the last segment pre- 
serving the same centromere-telomere orientation except for Ip 
(Group F3) (Supplemental Note). We detected clones (Groups F3 
and Gl) covering the evolutionary fission breakpoint as revealed 
by the BAG end mapping (one-end anchored on human chromo- 
somes 14 or 15) and absence of signal on human metaphases for 
three clones. For the homologous region on chromosome 14, we 
detected three human interchromosomal pericentromeric dupli- 
cations (15qll, 18pll, and 21qll; Groups G2 and G3) dated to 
have duplicated prior to human-orangutan divergence (Supple- 
mental Note). Furthermore, we identified a hominoid-specific in- 
version and macaque tandem duplication with 99.85% identity 
(Fig. 2) terminally flanked by human and macaque single copy as 
well as human/macaque direct-oriented regions. This suggests that 
the hominoid inversion and 14-15 duplication, as well as the 
macaque tandem duplication, end at the same position. Gene 
content analysis showed the presence of OR genes restricted to the 
duplicated regions in human and macaque (Fig. 2; Supplemental 
Table SI; Supplemental Note). 

Relationships between the LCR15 and chromosome 
15 evolutionary hotspots 

Since we found an LCR15 copy in proximity to the fission point, 
we investigated the relationships between the LCR15 {ITSN2/ 
DNM1-G0LGA2 duplications) (Supplemental Note) and chromo- 
some 15 evolutionary breakpoints in the primate lineage using 
molecular cytogenetic and phylogenetic analyses. We performed 
three-color FISH experiments on human, macaque, and marmoset 
chromosome metaphases of 10 RPll BAG triplets designed at 
15qll.2-ql4 and 15q24.2-q25.2 in addition to a CH250 BAG 
triplet for LCR15 at 15q26.3 (Fig. 3). We identified six LCR15 loci 
corresponding to evolutionary hotspots (named AB, Dl, D2, DEI, 
DE2, and F) and the LCR15 cluster at the novel pericentromeric 
region (15qll.2, locus A). (Note: the loci nomenclature links up 
the evolutionary history markers in Fig. 1.) Loci AB and F are 
orthologous to the evolutionary Gatarrhini inversion breakpoints 
and show orthologous LGR15 duplications in human and ma- 
caque, but not in marmoset, whereas locus DEI is orthologous to 
the marmoset fission breakpoint and has orthologous LGR15 
copies in all three species (Figs. 1,3). BAG clones at 15q24-q25 (loci 
Dl, D2, DEI, and DE2) mapped to the pericentromeric region of 
both the macaque and marmoset chromosomes, orthologous to 
the ancestral hominoid pericentromeric region. Noteworthy, BAG 



clones spanning LCR15 at loci Dl and D2 exclusively showed 
a tandem duplicated signal in all three species, comprising the 
marmoset lOp pericentromeric region. Alternatively, three-color 
FISH experiments using BAGs for the duplication clusters at the 
remaining loci did not show any break of synteny or pericen- 
tromeric localization in the three species analyzed (data not 
shown). 

G0LGA2 and ITSN2 neighbor- joining trees reported mono- 
phyletic clades of duplications mapping to orthologous locations 
of the ape pericentromeric region (human 15qll.l-ql3.1), the 
macaque and marmoset pericentromeric region (human 15q24- 
q25), and the Gatarrhini inversion breakpoints (human 15qll.2- 
ql3.3 and 15q26.3) (Fig. 4). Marmoset LGR15 copies map to 
chromosomes 6 and 10, as well as to unassembled scaffolds, and 
are monophyletic (bootstrap values equal to 80 and 100), likely 
because they homogenized. The ortholog to the human chro- 
mosome 10 copy is annotated in ape and macaque genomes; 
however, no copy is annotated in the marmoset genome though 
the assembly of the region appears complete (Supplemental 
Fig. S2). 

Study of human chromosome 14 newly minted 
pericentromeric region 

To analyze human 14q pericentromeric sequence acquired af- 
ter the fission event, we followed two different approaches: (1) 
screening of macaque high-density filters using three STS at 
chrl4:19 Mb (hgl8) (Supplemental Table S3); and (2) FISH map- 
ping comparison in human, orangutan, and macaque of an RPll 
BAG clone tiling path spanning the entire assembled sequence. In 
the first approach, we mapped macaque positive BAG clones on 
both human and macaque by FISH, BES alignment on reference 
genome assemblies (rheMac2 and hgl8), and sequencing of rep- 
resentative clones using Illumina technology (Supplemental Table 
S2). We arranged positive clones into six groups (I-VI) according 
to the main FISH signals on macaque chromosomes and FISH- 
mapped one clone per group on chimpanzee, gorilla, and orang- 
utan chromosomes (Supplemental Fig. S7; Supplemental Table S3). 
The majority of macaque clones (Groups I-FV) mapped in all primate 
species to human 2q21 orthologous location, an ancestral primate 
pericentromeric region (Yunis and Prakash 1982; Wienberg et al. 
1994; Roberto et al. 2008), and showed interchromosomal pericen- 
tromeric duplication in apes. Of note. Group IV reported a tandem 
duplicated signal at the macaque 9q pericentromeric region (ortho- 
logous to human chromosome 10), but orthologous FISH localiza- 
tion was identified only in orangutan chromosomes. The end 
sequences of this group had inconsistent or no homologous 
placement on both the macaque and human genome references, 
suggesting that the macaque 9q pericentromeric sequence is in- 
complete in the macaque reference and differs significantly from the 
human lOq pericentromeric sequence. This finding was confirmed 
by the Illumina sequencing of two clones (GH250-48F1 and GH25 0- 
34G6). On the other hand, Group V and VI clones mapped to a 
nonpericentromeric region (human 10q22.3 orthologous loca- 
tion) and to a different pericentromeric region (human 12pll 
orthologous location), respectively. At human 14qll, clones showed 
signal (Groups II, III, and IV), faint signal (Groups I and V), or no 
signal (Group VI). 

We found a similar pattern by a reciprocal approach using 
human BAG clones (Supplemental Fig. S8). In human, 14q peri- 
centromeric sequence (tandem duplicated signal at this locus) 
showed interchromosomal pericentromeric duplication mainly 
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Figure 4. Primate phylogeny of C0LCA2 and ITSN2 duplicons. Neighbor-joining phylogenetic trees were constructed using a 2.5-kb region of the 
C0LGA2 module and a 3.3-kb region of the ITSN2 module. Sequences were retrieved from human (hg19), chimpanzee (panTroS), gorilla (gorGorS), 
orangutan (ponAbe2), macaque (rheMac2), and marmoset (calJacS) reference genomes. Trees are drawn to scale, with branch lengths measured in the 
number of substitutions per site and bootstrap values (1 000 replicates) shown. Monophyletic clades of duplicons mapping to the ancestral loci (9qS4.1 1 
and 2p2S.S for G0LGA2 and ITSN2, respectively), chromosome 10, marmoset duplications, macaque and marmoset pericentromeric region, inversion 
breakpoints, and the pericentromeric region of ape chromosome 1 5 are highlighted in color code. The copies at the proximal inversion breakpoint and ape 
pericentromeric region homogenized, especially in the ITSN2 phylogenetic tree, do not group into two different branches. 



1 768 Genome Research 



www.genome.org 



Human chromosomes 14 and 15 evolution 



with chromosome 22. Of note, all clones hybridized to human 
2q21 and the orangutan orthologous locus along with the ma- 
caque 9q pericentromeric region. In orangutan they showed 
interchromosomal pericentromeric duplicated signals (in two 
cases at lOq); in addition, four clones showed a tandem duplicated 
signal on macaque chromosome 10. 

SD data (Bailey et al. 2001, 2002a) for this region (Table 1) 
reported that (1) 76% of the duplications (93% of the total length) 
are shared with ancestral hominoid acrocentric chromosomes 
(13, 14, 15, 21, and 22 together with 2A, 2B, 9, 11, and 18, which 
are no longer acrocentric in human; Supplemental Note); and 
(2) 58% of sites (79% of the total length) map to the q arm 
pericentromeric region of these chromosomes (13qll, 14qll, 
15qll, 21qll, and 22qll together with 2q21, 9pll, and 18pll, 
which are orthologous to ancestral q arm pericentromeric re- 
gions). The 2q21 locus shows the greatest homology in terms of 
both the number and the sequence length of shared duplica- 
tions. Among the other chromosomal positions, the highest 
values of SD are with 9ql2 (chromosome 9 heterochromatin) 
and chromosome 10. 



Table 1. SDs of the pericentromeric region of human 
chromosome 14 



Segmental duplications 


N 


Percent 


Size (bp) 


Percent 


Total 


245 


100% 


5,613,798 


100% 


Total on HAC 


186 


76% 


5,241,374 


93% 


Total on q1 1 of HAC 


141 


58% 


4,41 8,845 


79% 


9p11 


7 


3% 


86,423 


2% 


13q11 


5 


2% 


75,655 


1% 


14ql1 


16 


7% 


619,901 


11% 


ISqII 


21 


9% 


440,195 


8% 


ISpll 


12 


5% 


336,842 


6% 


2q21 (129.6-132.7 Mb) 


61 


25% 


1,61 3,843 


29% 


21q11 


8 


3% 


293,574 


5% 


22q11 


11 


4% 


952,412 


17% 


Total (other) 


97 


40% 


1,087,017 


19% 


chrl 


3 


1% 


12,334 


0% 


2p1 1 (other) 


3 


1% 


14,011 


0% 


chr4 random 


3 


1% 


40,732 


1% 


chr7 


4 


2% 


6932 


0% 


chr9q12 


22 


9% 


555,260 


10% 


chr9 (other) 


3 


1% 


49,962 


1% 


chrlO 


21 


9% 


1 77,995 


3% 


chrll (other) 


3 


1% 


4448 


0% 


chrl 2 


7 


3% 


27,374 


0% 


1 3q (other) 


7 


3% 


90,912 


2% 


chrl 6 


8 


3% 


19,390 


0% 


chrl 7 and chrl 7 random 


10 


4% 


55,255 


1% 


chrY 


3 


1% 


32,412 


1% 


Unclassified 










chr21_random 


6 


2% 


104,873 


2% 


chr22_random 


1 


0% 


3063 


0% 



Number and total length of SDs classified according to the chromo- 
somal location of the duplicate copy. Those mapping to chromosomes 
orthologous to ancestral hominoid acrocentric chromosomes (HAC, 
chromosomes 2A, 2B, 9, 1 1, 1 3, 14, 15, 18, 20, 21, and 22) and to 
regions orthologous to their q arm pericentromeric region are distin- 
guished from those mapping to other chromosomal locations. Two 
different locations, q1 1 of the ancestral acrocentric chromosome and 
"other", were considered for these chromosomes. Regarding chro- 
mosome 9, the location 9ql2 was also distinguished. Chr21_random 
and chr22_random are unclassified because the location (21qll or 
chr21 [other], 22q1 1 orchr22 [other]) for these duplications could not 
be assigned. 



Discussion 

Reshuffling of the hominoid OR gene repertoire after 
the 14/ 15 fission 

An ancestral liominoid fission event gave rise to iiuman ciiromo- 
somes 14 and 15 and converted a single-copy euchromatic region 
from an interstitial to a terminal location. Since the genomic or- 
ganization of the macaque region orthologous to the fission site 
closely resembles the ancestral hominoid one, we constructed 
a 1.6-Mb macaque BAG clone contig and compared the content 
and organization of human and macaque. Sequences from the 
novel 15q subtelomeric and 14q pericentromeric regions were 
extensively and specifically duplicated to other subtelomeric and 
pericentromeric regions, respectively, probably as a direct conse- 
quence of the evolutionary change in chromosomal location and 
consistent with the SD enrichments observed for these regions in 
the human genome (Bailey et al. 2001; Linardopoulou et al. 2005; 
Bailey and Eichler 2006). Notably, we found that 14q pericentro- 
meric sequences were duplicated to the q arm pericentromeric re- 
gion of other acrocentric (15 and 21) or ancestrally acrocentric (18) 
(Dennehey et al. 2004) chromosomes and subsequently experi- 
enced duplication exchanges mainly with acrocentric or an- 
cestrally acrocentric chromosomes (Table 1). These observations 
suggest that, first, the pericentromeric region of acrocentric chro- 
mosomes is rather preferentially duplicated with other acrocentric 
chromosomes and, second, these exchanges occur primarily be- 
tween the q arms possibly because the presence of the rDNA and 
other repetitive sequences prevent the "invasion" of the p arm, as 
previously reported in chimpanzee and gorilla telomeric ends 
(Ventura et al. 2012). 

Clones positive to the STS 265F14t7 showed weak FISH sig- 
nals and single-end sequence placement to human 15q26.3 or 
14qll (Fig. 2; Supplemental Table SI), consistent with the occur- 
rence of the fission event and sequence loss at this site, as pre- 
viously shown for clone CH250-246C20 (Rudd et al. 2009). The 
fission and consequent chromosomal rearrangements shuffled the 
OR gene repertoire of hominoids. In particular, the fission caused 
the loss of OR gene copies at the breakpoint (Rudd et al. 2009) 
while the consequent duplication events created additional copies 
in other subterminal and pericentromeric regions. We observed a 
correlation between the presence of OR genes and the involvement 
in chromosomal rearrangements (deletion, duplication, and 
inversion) — all regions that underwent rearrangement in human 
or in both human and macaque genomes embedded OR genes 
(besides the ITSN2/DNM1-G0LGA2 duplicon) whereas the flank- 
ing single-copy regions contained few OR genes and genes other 
than OR. OR genes form the largest mammalian gene family and 
have been subjected to extensive pseudogenization particularly 
in the primate lineage. Not surprisingly, they have been contin- 
uously reshuffled during mammalian evolution by frequent du- 
plication and deletion events (Niimura and Nei 2007). In this 
regard, their presence at the novel subtelomeric and pericentro- 
meric regions and subsequent duplications unlikely affected the 
species' survival, and the fission event and its consequences were 
likely neutral. 



ITSNl/DNMI-GOLGM duplication clusters map 
to evolutionary hotspots 

Multiple studies demonstrated the association of SDs and evolu- 
tionary rearrangements in primates (Armengol et al. 2003; Locke 
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et al. 2003; Bailey et al. 2004; Murphy et al. 2005). In this study, we 
further prove this association, specifically between LCR15 and 
regions involved in evolutionary rearrangements. We show that 
LCR15 sequences cluster around (1) the evolutionary new cen- 
tromere (15qll.2, ~3 Mb away from the centromere); (2) the 
breakpoints of the pericentric inversion that occurred in the 
Catarrhini ancestor (15ql3 and 15q26.3); and (3) the ancestral 
centromere (15q24-q25, <1 Mb from the ancestral centromere 
locus). Interestingly golgin repeats were also found at the break- 
points of pathogenic chromosome 15 deletions (F Antonacci, MY 
Dennis, J Huddleston, PH Sudmant, K Meltz Steinberg, TA Graves, 
L Vives, M Malig, CT Amemiya, A Stuart, et al., unpubL), empha- 
sizing and confirming the dual effect of SD genomic instability in 
both evolution and disease. 

LCR15 copies are present on the macaque and marmoset 
orthologous regions of 15qll.2 and, therefore, were likely present 
in the Haplorhini ancestor, but expanded in human, possibly as 
a result of the neocentromere emergence. LCR15 sequences are 
present on human and macaque yet absent from marmoset re- 
gions orthologous to the inversion breakpoints that occurred in 
the Catarrhini ancestral submetacentric chromosome (loci AB 
and F), revealing these copies may have emerged as a result of the 
inversion or may have driven the event. 

There is additional evidence of LCR15 evolutionary in- 
stability. The same site at 15qll was shown to be involved in the 
pericentric inversion of chimpanzee 15p and in the insertion of 
a novel SD within the human lineage (Locke et al. 2003). Here, we 
found the occurrence of two independent chromosomal rear- 
rangements in proximity to one site (human 15q26.3) after its 
acquisition of an LCR15 copy — the chromosomal fission in the 
hominoid lineage and the duplicative insertion from chromosome 
13 in the macaque lineage. This suggests that LCR15 may not only 
serve as preferential sites of chromosomal breakage and rear- 
rangement, like the chimpanzee inversion, but they may be sub- 
ject to further expansion and duplication associated with recurrent 
events, such as the duplicate copies mapping to the inversion 
breakpoints in the Catarrhini ancestor. 

In FISH experiments, LCR15 signals observed on marmo- 
set metaphases mapped mainly to the pericentromeric regions of 
chromosomes 6p and lOp — particularly lOp — orthologous to hu- 
man 15q24-q25 (loci Dl, D2, DEI, and DE2), suggesting that the 
ancestral pericentromeric region was probably the seed of LCR15 
and the first chimeric ITSN2/DNM1-G0LGA2 unit likely formed at 
this locus in the Haplorhini ancestor. Moreover, since no copy is 
present in the marmoset region orthologous to human chromo- 
some 10, this copy emerged in the Catarrhini ancestor; therefore, 
it was unlikely the site where the first combined unit formed as 
previously suggested (Zody et al. 2006). 

The chromosome 14 pericentromeric region consists of SDs 
from the pericentromeric region of ancestral hominoid 
chromosomes 2B and 10 

After the fission event, human chromosome 14 acquired —19 Mb 
of sequence over the last 25 million years. It is largely composed of 
three blocks: (1) a 1.3-Mb transition region between the q arm 
euchromatin and the centromere; (2) a 3-Mb centromere com- 
posed of alphoid DNA; and (3) a 15-Mb heterochromatic short arm 
containing arrays of ribosomal RNA genes, satellite sequences, and 
other repetitive elements (Kehrer-Sawatzki et al. 1998). We ana- 
lyzed the first of these three blocks and showed that it was mainly 
formed through pericentromeric duplicative transpositions from 



ancestral hominoid chromosomes 2B and 10 (homologous to 
human 2ql4-qter and 10, respectively). We note that the centro- 
mere of chromosome 2B inactivated in both the macaque (Ventura 
et al. 2007) and human lineages — the latter after a fusion event 
occurred 3-5 million years ago and mapping to 2q21 (IJdo et al. 
1991). The pericentromeric sequences on chromosome 10 were 
lost in human and African ape, and evidence of their ancestral 
location on this chromosome still exists in orangutan and ma- 
caque. Notably, chromosomes 2B and 10 have been partners in 
another process during great ape evolution — the formation of 
subterminal heterochromatin in the chimpanzee and gorilla line- 
ages (Ventura et al. 2012) — ^but the genomic regions involved in 
the process were different. 

STS derived from the new human chromosome 14 pericen- 
tromeric sequence and probed against the macaque genomic li- 
brary retrieved clones mapping to either pericentromeric or eu- 
chromatic regions. Among the latter, the euchromatic SD-rich 
region at human 10q22.3 is associated with chromosomal rear- 
rangement and human disease (Melberg et al. 1999; Quintero-Del- 
Rio et al. 2002; Fallin et al. 2003; Faraone et al. 2006; Hofmann 
et al. 2008; Kuhl et al. 2008; Venken et al. 2008; O'Meara et al. 
2012); in the gorilla genome, one breakpoint of the inversion of 
chromosome lOq is 1 Mb proximal to this locus (Carbone et al. 
2002; Ventura et al. 2011). 

In summary, we propose a model that describes the evolu- 
tion of ape chromosomes 14 and 15 (Fig. 5). After the fission, the 
resulting two new chromosomal ends achieved different func- 
tional roles with respect to the chromosome. The chromosome 
14 portion acquired the centromeric functionality necessary for 
daughter cell segregation, resulting in a telocentric chromosome, 
while the chromosome 15 end became a telomere participating in 
interchromosomal DNA exchanges with other subtelomeric re- 
gions. Both novel chromosome ends earned telomeric repetitive 
sequences (TTAGGG) to be protected from degradation and to 
prevent chromosomal end-to-end fusions. The original centro- 
mere, located on chromosome 15, became inactive and a neo- 
centromere was seeded at the chromosome tip not derived by the 
fission, also making chromosome 15 a telocentric chromosome. 
The neocentromeres rapidly acquired the complex organization 
characteristic of primate centromeres — a large core of alphoid se- 
quences surrounded by SDs (Eichler et al. 1999). SDs mainly from 
the long arm pericentromeric region of the acrocentric chromo- 
some 2B and from the pericentromeric region of chromosome 10 
contributed to the formation of the 14q pericentromeric region. 
SDs involving this genomic region continue to accumulate in the 
human lineage, mainly within the long arm pericentromeric re- 
gion of chromosome 22 (Bailey et al. 2002b). It is known that the 
juxtaposition of constitutive heterochromatin and transcription- 
ally active genes induces position effects, effectively silencing 
transcription (Dillon and Festenstein 2002). The SDs served the 
potentially important role of buffering chromosome 14 euchro- 
matin and the novel centromeric heterochromatin, possibly pro- 
tecting and maintaining the transcription regulatory landscape of 
the 14qll genes suddenly found in a pericentromeric region. 
Chromosome 15 experienced further LCR15 duplications that 
generated new copies, particularly within the novel pericentro- 
meric region. In the human karyotype, the neocentromere of 
chromosome 14 acquired alphoid DNA of the suprachromosomal 
families 2, 4, and 5; the one of chromosome 15 acquired alphoid 
DNA of the suprachromosomal families 2 and 4 (Alexandrov et al. 
2001). Finally, duplications from the p arm of acrocentric chro- 
mosomes created the heterochromatic p arm of ribosomal RNA 
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Figure 5. Model of the evolution of ape chromosomes 14 and 15. The picture highlights the centromere repositioning on chromosome 15, the 
neocentromere formation on chromosome 14, the sequence duplications from and to the novel pericentromeric and subtelomeric regions, and 
the acquisition of telomeric repeats, rDNA, and alphoid DNA that occurred in the last 25 million years of human evolution. The pericentric inversions in 
the chimpanzee and gorilla lineages caused the loss of the rDNA from the inverted chromosomes. 



genes, turning chromosomes 14 and 15 from telocentric to 
acrocentric chromosomes. We note that in human, chimpanzee, 
and orangutan, only acrocentric chromosomes bear the rDNA on 
the p arm, implying that the acquisition of the rDNA short arm 
was an obligated fate for chromosomes 14 and 15. Yet, in the 
gorilla genome only two acrocentric chromosomes, 21 and 22, 
carry the rDNA while the short arms of the remaining acrocentric 
chromosomes are composed mainly of satellite III sequences 
(Ventura et al. 2012). 

This work highlights the dynamic and intimate relationship 
between duplication and rearrangement in the evolution of hu- 
man chromosomes 14 and 15. It raises the possibility that not only 
duplication triggers chromosomal rearrangement through NAHR 
but also that chromosomal breakage gives rise to duplication events 
(Kehrer-Sawatzki and Cooper 2008; Girirajan et al. 2009). We propose 
a possible additional role of SDs in genome evolution where the 
occurrence of duplication events in response to chromosomal 
breakage might represent a possible "feedback mechanism" that 
produces additional DNA material to recover the damage caused 
by the breakage. 

Methods 

Hybridization of high-density filters 

Radioactive genomic hybridization of the sixfold coverage CHORI- 
250 (rhesus macaque) segment 1 BAG library was carried out 
according to the protocol available at GHORI BAGPAG resources 
(http://bacpac.chori.org/highdensity.htm). Probes were obtained 
by means of PGR amplification of macaque genomic DNA (Sup- 
plemental Table S4). The number of positive clones provided es- 
timation of the copy number in the macaque genome. EES were 
repeatmasked (Smit and Hubley 2010) and mapped on macaque 



(rheMac2) and human (hgl8) reference genomes using the BLAT 
tool at the UGSG Genome Browser (http://genome.ucsc.edu/cgi-bin/ 
hgBlat). 

Fluorescence in situ hybridization [FISH] 

Metaphase spreads and interphase nuclei were prepared from 
lymphoblastoid or fibroblast cell lines of Homo sapiens (HSA), 
Pan troglodytes (PTR), Gorilla gorilla (GGO), Pongo pygmaeus 
(PPY), Macaca mulatta (MMU), Cercopithecus aethiops (GAE), 
Trachypithecus cristatus (TGR), and Callithrix jacchus (GJA). FISH 
experiments were performed using BAG clones (300 ng) directly 
labeled by nick-translation with Gy3-dUTP, Gy5-dUTP, and 
fluorescein-dUTP as previously described (Lichter et al. 1990) with 
minor modifications. Hybridization was performed at 37°G in 2x 
SSG, 50% (v/v) formamide, 10% (w/v) dextran sulfate, 3 |jLg GOt-1 
DNA, and 3 fxg sonicated salmon sperm DNA, in a volume of 10 |jlL. 
Post-hybridization washing was at high stringency (60°G in 0.1 X 
SSG, three times) or at low stringency for cross-species hybrid- 
ization (37°G in 2X SSG, 50% formamide, three times, and 42°G 
in 2x SSG, three times). Nuclei and chromosome metaphases 
were DAPI-stained. Digital images were obtained using a Leica 
epifluorescence microscope equipped with a cooled GGD cam- 
era. Fluorescence signals detected with Gy3, Fluorescein, and 
Gy5 filters and chromosomes and nuclei images detected with 
DAPI filter were recorded separately as grayscale images. Pseu- 
docoloring and merging of images were performed using Adobe 
Photoshop software. 

Phylogenetic analysis 

Duplicate copies of a 3-kb region from ITSN2 and a 2.5-kb region 
from G0LGA2 were retrieved from the hgl9, panTro3, gorGor3, 
ponAbe2, rheMac2, and caljac3 reference genomes through BLAT 
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search (http://genome.ucsc.edu/cgi-bin/hgBlat) (Kent 2002). The 
hgl9 reference was used instead of hgl8 because in the most recent 
release of the human genome (hgl9), all LCR15 duplicons are 
placed on the chromosome 15 sequence, and there are no chrl5_ 
random locations. Multiple sequence alignments were performed 
using Clustal W (Thompson et al. 1994) (match 5, mismatch -2, 
match with N 0, gap opening 25, gap extension 0.05 for the ITSN2 
region; match 4, mismatch -2, match with N 0, gap opening 15, 
gap extension 0.05 for the G0LGA2 region). Neighbor- joining 
phylogenetic trees based on the Kimura 2-parameter model 
(Kimura 1980) (pairwise deletion option) were constructed in 
MEGA5 (Tamura et al. 2011). A bootstrap test with 1000 replicates 
was conducted to evaluate the statistical significance of each node 
(Felsenstein 1985). 

Sanger sequencing 

Ten STS regions were PCR-amplified from positive BAG clones and 
PGR products were sequenced using PGR primers (Supplemental 
Table S4). DNA from BAG clones were extracted using the BAG Prep 
Protocol (Schein et al. 2004), and BAG ends were sequenced (if not 
available in the NGBI Trace Archive) using SP6 and T7 primers. 
Sequencing was performed on an ABI PRISM 3100 Genetic Ana- 
lyzer using the BigDye Terminator v3.1 Ghemistry (Applied Bio- 
systems) according to manufacturer instructions. 

Illumina sequencing and analysis of BAG clones 

BAG clone DNA from the GH250 macaque library was isolated by 
a modified alkaline lysis miniprep procedure and paired-end se- 
quenced on an Illumina HiSeq 2000 (Supplemental Note). Short 
Illumina reads (50 bp) were mapped to the hard-masked references 
hglS and rheMac2 with mrsFAST (Hach et al. 2010) using an edit 
distance of three and single-end mapping mode. We determined 
the best location for each clone using the coverage of reads across 
each genome. For each clone we calculated the mean and standard 
deviation of its coverage and selected regions for which the cov- 
erage was greater than the mean plus two standard deviations and 
the total size of the region was >1 kbp. We manually investigated 
each of the resulting regions in the UGSG Genome Browser to 
determine the best location and whether any regions should be 
merged. We determined the repeat content of short reads by 
aligning reads to all known RepeatMasker and Tandem Repeat 
Finder (TRF) sequences in hglS and rheMac2. Repetitive co- 
ordinates were obtained from UGSG Genome Browser tracks 
(rmsk.txt.gz for RepeatMasker and chromTrf.tar.gz for TRF) and 
corresponding sequences were extracted from each reference to 
create a "repeat" reference. We mapped short reads to these repeat 
references using BWA (Burrows-Wheeler Alignment) in single- 
end mapping mode and counted the total number of reads that 
mapped to each reference with at least a quality of 1 . To determine 
the amount of vector sequence or bacteria contamination present 
in the BAG short reads, we aligned these reads to a reference 
consisting of the vector pTARBAG2.1 and the Escherichia coli 
strain DHIOB using mrsFAST with an edit distance of three and 
single-end mapping mode. We counted the number of reads that 
mapped to the vector and the E. coli genome with a quality of at least 
1 to determine the proportion of vector or E. coli reads in the com- 
plete set of reads. 

Data access 

BAG end sequences are available in the Genome Survey Sequence 
division of GenBank (http://www.ncbi.nlm.nih.gov/nucgss) un- 
der accession numbers KG088791-KG088955. GHORI-250 BAG 



clone sequence data are available in the Sequence Read Archive 
(SRA; http://www.ncbi.nlm.nih.gov/sra) under accession number 
SRP028268. 
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